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Fusion proteins containing N-terminal fragments of human 
serum albumin 



The present invention relates to fusion polypeptides where 
two individual polypeptides or parts thereof are fused to 
form a single amino acid chain. Such fusion may arise 
from the expression of a single continuous coding sequence 
formed by recombinant DNA techniques. 

Fusion polypeptides are known, for example those where a 
polypeptide which is the ultimately desired product of the 
process is expressed with an N-terminal "leader sequence " 
which encourages or allows secretion of the polypeptide 
from the cell. An example is disclosed in EP-A-116 201 
( Chiron ) . 

Human serum albumin (HSA) is a known protein found in the 
blood. EP-A-147 198 (Delta Biotechnology) discloses its 
expression in a transformed host, in this case yeast. Our 
earlier application EP-A-322 094 discloses N-terminal 
fragments of HSA/ namely those consisting of residues 1-n 
where n is 369 to 419, which have therapeutic utility. 
The application also mentions the possibility of fusing 
the C-terroinal residue of such molecules to other, 
unnamed , polypeptides . 
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One aspect of the present invention provides a fusion 
polypeptide comprising, as at least part of the N- terminal 
portion thereof, an N-terminal portion of HSA or a variant 
thereof and, as at least part of the C- terminal portion 
thereof, another polypeptide except that, when the said N- 
terminal portion of HSA is the 1-n portion where n is 369 
to 419 or a variant thereof then the said 

polypeptide is (a) the 585 to 1578 portion of human 
fibronectin or a variant thereof, (b) the 1 to 368 portion 
of CD4 or a variant thereof, (c) platelet derived growth 
factor, or a variant thereof, (d) transforming growth 
factor, or a variant thereof, (e) the 1-261 portion of 
mature human plasma fibronectin or a variant thereof, (f) 
the 278-578 portion of mature human plasma fibronectin or 
a variant thereof, (g) the 1-272 portion of mature human 
von Willebrand's Factor or a variant thereof, or (h) 
alpha-l-antitrypsin or a variant thereof. 

The N-terminal portion of HSA is preferably the said 1-n 
portion, the 1-177 portion (up to and including the 
cysteine), the 1-200 portion (up to but excluding the 
cysteine) or a portion intermediate 1-177 and 1-200. 
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The term "human serum albumin" (HSA) is intended to 
include (but not necessarily to be restricted to) known or 
yet-to-be-discovered polymorphic forms of HSA. For 
example, albumin Naskapi has Lys-372 in place of Glu-372 
and pro-albumin Christchurch has an altered pro-sequence . 
The term "variants" is intended to include (but not 
necessarily to be restricted to) minor artificial 
variations in sequence (such as molecules lacking one or a 
few residues, having conservative substitutions or minor 
insertions of residues, or having minor variations of 
amino acid structure). Thus polypeptides which have 80%, 
preferably 85% , 90%, 95% or 99% , homology with KSA are 
deemed to be "variants". It is also preferred for such 
variants to be physiologically equivalent to HSA? that is 
to say, variants preferably share at least one 
pharmacological utility with HSA. Furthermore, any 
putative variant which is to be used pharmacologically 
should be non-immunogenic in the animal (especially human) 
being treated. 

Conservative substitutions are those where one or more 
amino acids are substituted for others having similar 
properties such that one skilled in the art of polypeptide 
chemistry would expect at least the secondary structure, 
and preferably the tertiary structure, of the polypeptide 
to be substantially unchanged. For example, typical such 
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substitutions include asparagine for glutamine, serine for 
asparagine and arginine for lysine. Variants may 
alternatively, or as veil, lack up to ten (preferably only 
one or two) intermediate amino acid residues (ie not at 
the termini of the said N- terminal portion of HSA) in 
comparison with the corresponding portion of natural HSA; 
preferably any such omissions occur in the 100 to 369 
portion of the molecule (relative to mature HSA itself) 
(if present). Similarly, up to ten, but preferably only 
one or two, amino acids may be added, again in the 100 to 
369 portion for preference (if present). The term 
"physiologically functional equivalents " also encompasses 
larger molecules comprising the said sequence plus a 
further sequence at the N-terminal (for example, pro-HSA, 
pre-pro-HSA and met-HSA) . 

Clearly, the said "another polypeptide 11 in the fusion 
compounds of the invention cannot be the remaining portion 
of HSA, since otherwise the whole polypeptide would be 
HSA, which would not then be a "fusion polypeptide". 

Even when the HSA-like portion is not the said 1-n portion 
of HSA, it is preferred for the non-HSA portion to be one 
of the said (a) to (h) entities. 
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The 1 to 368 portion of CD 4 represents the first four 
disulphide-linked immunoglobulin- like domains of the human 
T lymphocyte CD4 protein, the gene for and amino acid 
sequence of which are disclosed in D. Smith et al (1987) 
Science 328 , 1704-1707. It is used to combat HIV 
infections . 

The sequence of human platelet-derived growth factor 
{ PDGF ) is described in Collins et al (1985) Nature 316, 
748-750. Similarly, the sequence of transforming growth 
factors p (TGF-J3) is described in Derynck et al (1985) 
Nature 316 , 701-705. These growth factors are useful for 
wound -healing . 

A cDNA sequence for the 1-261 portion of Fn was disclosed 
in EP-A-207 751 (obtained from plasmid pFH6 with 
endonuclease PyuII). This portion binds fibrin and can be 
used to direct fused compounds to blood clots . 

A cDNA sequence for the 278-578 portion of Fn, which 
contains a collagen-binding domain, was disclosed by R.J. 
Owens and F.E. Baralle in 1986 E.M. B.O.J. 5, 2825-2830. 
This portion will bind to platelets. 
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The 1-272 portion of von Wiilebrand's Factor binds and 
stabilises factor VIII, The sequence is given in Bontham 
et al , Nucl. Acids Res. 14, 7125-7127. 

Variants of alpha- 1 -antitrypsin include those disclosed by 
Rosenburg et al (1984) Nature 312 , 77-80. In particular, 
the present invention includes the Pittsburgh variant 
(Met 358 is mutated to Arg) and the variant where Pro 357 
and Met 3 5 8 are mutated to alanine and arginine 
respectively. These compounds are useful in the treatment 
of septic shock and lung disorders. 

Variants of the non-HSA portion of the polypeptides of the 
invention include variations as discussed above in 
relation to the HSA portion, including those with 
conservative amino acid substitutions , and also homologues 
from other species. 

The fusion polypeptides of the invention may have N- 
terminal amino acids which extend beyond the portion 
corresponding to the N- terminal portion of HSA. For 
example, if the HSA-like portion corresponds to an N- 
terminal portion of mature HSA, then pre-, pro-, or pre- 
pro sequences may be added thereto, for example the yeast 
alpha-factor leader sequence. The fused leader portions 
of WO 90/01063 may be used. The polypeptide which is 
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fused to the HSA portion may be a naturally-occurring 
polypeptide, a fragment thereof or a novel polypeptide, 
including a fusion polypeptide. For example, in Example 3 
below, a fragment of fibronectin is fused to the RSA 
portion via a 4 amino acid linker. 

It has been found that the amino terminal portion of the 
HSA molecule is so structured as to favour particularly 
efficient translocation and export of the fusion compounds 
of the invention in eukaryotic cells. 

A second aspect of the invention provides a • transformed 
host having a nucleotide sequence so arranged as to 
express a fusion polypeptide as described above. By "so 
arranged", we mean, for example, that the nucleotide 
sequence is in correct reading frame with an appropriate 
RNA polymerase binding site and translation start sequence 
and is under the control of a suitable promoter. The 
promoter may be homologous with or heterologous to the 
host. Downstream (3') regulatory sequences may be 
included if desired, as is known. The host is preferably 
yeast (for example Saccharomyces spp. , e.g. S. cerevisiae; 
Kluweromvces spp., e.g. K. lactis ; Pichia spp.; or 
Schizosaccharomvces spp., e.g. S. pombe ) but may be any 
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other suitable host such as E . coli , B. subtilis, 
Aspergillus spp. , mammalian cells, plant cells or insect 
cells . 

A third aspect of the invention provides a process for 
preparing a fusion polypeptide according to the first 
aspect of the invention by cultivation of a transformed 
host according to the second aspect of the invention, 
followed by separation of the fusion polypeptide in a 
useful form. 

A fourth aspect of the invention provides therapeutic 
methods of treatment of the human or other animal body 
comprising administration of such a fusion polypeptide. 

In the methods of the invention we are particularly 
concerned to improve the efficiency of secretion of useful 
therapeutic human proteins from yeast and have conceived 
the idea of fusing to amino- terminal portions of HSA those 
proteins which may ordinarily be only inefficiently 
secreted. One such protein is a potentially valuable 
wound-healing polypeptide representing amino acids 585 to 
1578 of human fibronectin (referred to herein as Fn 585- 
1578). As we have described in a separate application 
(filed simultaneously herewith) this molecule contains 
cell spreading, chemotactic and chemokinetic activities 
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present invention wherein the C-terminal portion is Fn 
585-1578 can be used for wound healing applications as 
biosynthesised, especially where the hybrid human protein 
will be topically applied. However, the portion 

representing amino acids 585 to 1578 of human fibronectin 
can if desired be recovered from the fusion protein by 
preceding the first amino acid of the fibronectin portion 
by amino acids comprising a factor X cleavage site. After 
isolation of the fusion protein from culture supernatant, 
the desired molecule is released by factor X cleavage and 
purified by suitable chromatography (e.g. ion-exchange 
chromatography). Other sites providing for enzymatic or 
chemical cleavage can be provided, either by appropriate 
juxtaposition of the N-terminal and C-terminal portions or 
by the insertion therebetween of an appropriate linker. 

At least some of the fusion polypeptides of the invention, 
especially those including the said CD 4 and vWF fragments, 
PDGF and ct^AT, also have an increased half -life in the 
blood and therefore have advantages and therapeutic 
utilities themselves, namely the therapeutic utility of 
the non-HSA portion of the molecule. In the case of a 2 AT 
and others, the compound will normally be administered as 
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a one-off dose or only a few doses over a short: period, 
rather than over a long period, and therefore the 
compounds are less likely to cause an immune response. 

EXAMPLES : SUMMARY 

Standard recombinant DNA procedures were as described by 
Maniatis et al (1982 and recent 2nd edition) unless 
otherwise stated. Construction and analysis of phage M13 
recombinant clones was as described by Messing (1983) and 
Sanger et al (1977). 

DNA sequences encoding portions of human serum albumin 
used in the construction of the following molecules are 
derived from the plasmids mH0B12 and pDBD2 (EP-A-322 094, 
Delta Biotechnology Ltd, relevant portions of which are 
reproduced below) or by synthesis of oligonucleotides 
equivalent to parts of this sequence. DNA sequences 
encoding portions of human fibronectin are derived from 
the plasmid pFHDELl, or by synthesis of oligonucleotides 
equivalent to parts of this sequence. Plasmid pFHDELl , 
which contains the complete human cDNA encoding plasma 
fibronectin, was obtained by ligation of DNA derived from 
plasmids pFH6, 16, 54, 154 and 1 (EP-A-207 751; Delta 
Biotechnology Ltd) . 
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This DNA represents an mRNA variant which does not contain 
the ' ED' sequence and had an 89-amino acid variant of the 
III-CS region (R.J. Owens, A.R . Kornblihtt and F.E. 
Baralle (1986) Oxford Surveys on Eukaryotic Genes 3 141- 
160). The map of this vector is disclosed in Fig. 11 and 
the protein sequence of the mature polypeptide produced by 
expression of this cDNA is shown in Fig. 5. 

Oligonucleotides were synthesised on an Applied Biosystems 
380B oligonucleotide synthesiser according to the 
manufacturer's recommendations (Applied Biosystems, 
Warrington, Cheshire, UK). 

An expression vector was constructed in which DNA encoding' 
the HSA secretion signal and mature HSA up to and 
including the 387th amino acid, leucine, fused in frame to 
DNA encoding a segment of human fibronectin representing 
amino acids 585 to 1578 inclusive, was placed downstream 
of the hybrid promoter of EP-A-258 06 7 (Delta 
Biotechnology), which is a highly efficient galactose- 
inducible promoter functional in Saccharom vces cerevisiae. 
The codon for the 157 8th amino acid of human fibronectin 
was directly followed by a stop codon (TAA) and then the 
S. cerevisiae phosphoglycerate kinase (PGK) gene 
transcription terminator. This vector was then introduced 
i nro S . cerevisiae by transformation, wherein it directed 
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the expression and secretion from the cells of a hybrid 
molecule representing the N-terminal 387 amino acids of 
HSA C-terminally fused to amino acids 585 to 1578 of human 
f ibronectin . 

In a second example a similar vector is constructed so as 
to enable secretion by S. cerevisiae of a hybrid molecule 
representing the N-terminal 195 amino acids of HSA C- 
terminally fused to amino acids 585 to 1578 of human 
f ibronectin. 

Aspects of the present invention will now be described by 
way of example and with reference to the accompanying 
drawings, in which: 

Figure 1 (on two sheets) depicts the amino acid sequence 
currently thought to be the most representative of natural 
HSA, with (boxed) the alternative C-termini of HSA(l-n); 

Figure 2 (oh two sheets) depicts the DNA sequence coding 
for mature HSA, wherein the sequence included in Linker 3 
is underlined; 

Figure 3 illustrates, diagrammatically, the construction 
of mH0B16; 
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Figure 4 illustrates, dia grammatically , the construction 
of pHOB31; 

Figure 5 (on 6 sheets) illustrates the mature protein 
sequence encoded by the Fn plasmid pFHDELl; 

Figure 6 illustrates Linker 5, showing the eight 
constituent oligonucleotides; 

Figure 7 shows schematically the construction of plasmid 
pDBDF2 ; 

Figure 8 shows schematically the construction of plasmid 
pDBDF5; 

Figure 9 shows schematically the construction of plasmid 
pDBDF9; 

Figure 10 shows schematically the construction of plasmid 
DBDF12 , using plasmid pFHDELl; and 

Figure 11 shows a map of plasmid pFHDELl. 
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EXAMPLE 1 : HSA 1-387 FUSED TO Fn 585-1578 

The following is an account of a preparation of plasmids 
comprising sequences encoding a portion of HSA, as is 
disclosed in EP-A-322 094. 

The human serum albumin coding sequence used in the 
construction of the following molecules is derived from 
the plasmid M13mpl9.7 (EP-A-201 239, Delta Biotech- nology 
Ltd.) or by synthesis of oligonucleotides equivalent to 
parts of this sequence. Oligonucleotides were synthesised 
using phosphoramidite chemistry on an Applied Biosystems 
380B oligonucleotide synthesizer according to the 
manufacturer's recommendations (AB Inc., Warrington, 
Cheshire, England) . 

An oligonucleotide was synthesised (Linker A), which 
represented a part of the known HSA coding sequence 
(Figure 2) from the PstI site (1235-1240, Figure 2) to the 
codon for valine 381 wherein that codon was changed from 
GTG to GTC: 
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D 


P 


H 


E 


C 


Y 


GAT 


CCT 


CAT 


GAA 


TGC 


TAT 


CTA 


GGA 


GTA 


CTT 


ACG 


ATA 



1247 



A 


K 


V. 


F 


D 


E 


F 


K 


GCC 


AAA 


GTG 


TTC 


GAT 


GAA 


TTT 


AAA 


CGG 


TTT 


CAC 


AAG 


CTA 


CTT 


AAA 


TTT 






1267 












P 


L 


V 












CTT • 


GTC 


3' 












GGA 


CAG 


5' 












Linker 


1 was 


ligated into 


the 


vector 


M13mpl9 


(NC 



et al , 1983) which had been digested with PstI and Hin di 
and the ligation mixture was used to transf ect E . coli 
strain XLl-Blue (Stratagene Cloning Systems, San Diego, 
CA) . Recombinant clones were identified by their failure 
to evolve a blue colour on medium containing the 
chromogenic indicator X-gal ( 5-bromo-4-chloro-3-indolyl-£- 
D-galactoside) in the present of IPTG ( isopropylthio-0- 
galactoside) . DNA seguence analysis of template DNA 
prepared from bacteriophage particles of recombinant 
clones identified a molecule with the required DNA 
sequence, designated mHOB12 (Figure 3). 
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K13mpl9.7 consists of the coding region of mature HSA in 
M13mpl9 (Norrander et al , 1983) such that the codon for 
the first amino acid of HSA, GAT, overlaps a unique Xhol 
site thus: 

Asp Ala. 
5 ' CTCGAGATGCA 3 ' 

3' GAGCTCTACGT 5' 

Xho l 

(EP-A-210 239). K13mpl9.7 was digested with Xho l and made 
flush-ended by SI -nuclease treatment and was then 
ligated with the following oligonucleotide (Linker 2): 

Linker 2 

5'TCTTTTATCCAAGCTTGGATAAAAGA 3' 
3 ' AGAAAATAGGTTCGAACCTATTTTCT 5' 

Hindiri 

The ligation mix was then used to transfect E.coli XL1- 
Blue and template DNA was prepared from several plaques 
and then analysed by DNA sequencing to identify a clone, 
pDBDl (Figure 4), with the correct sequence. 
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A 1.1 kb Hin di 1 1 to Pst I fragment representing the 5' end 
of the HSA coding region and one half of the inserted 
oligonucleotide linker was isolated from pDBDl by agarose 
gel electrophoresis. This fragment was then ligated with 
double stranded mHOB12 previously digested with Hin di I I 
and Pst I and the ligation mix was then used to transfect 
E.coli XLl-Blue. Single stranded template DNA was 
prepared from mature bacteriophage particles of several 
plagues. The DNA was made double stranded in vitro by 
extension from annealed sequencing primer with the Klenow 
fragment of DNA polymerase I in the presence of 
deoxynucleoside triphosphates. Restriction enzyme 

analysis of this DNA permitted the identification of a 
clone with the correct configuration, mHOB15 (Figure 4). 

The following oligonucleotide (Linker 3) represents from 
the codon for the 382nd amino acid of mature HSA 
(glutamate, GAA) to the codon for lysine 389 which is 
followed by a stop codon (TAA) and a Hin di 1 1 site and then 
a BamHI cohesive end: 

Linker 3 

EEPQNLIKJ 
5' GAA GAG CCT CAG AAT TTA ATC AAA TAA GCTTG 3' 
3 ' CTT CTC GGA GTC TTA AAT TAG TTT ATT CGAACCTAG 5 ' 
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This was ligated into double stranded mHOBIS, previously- 
digested with Hind i and BamH I . After ligation, the DNA 
was digested with Hin di to destroy all non-recombinant 
molecules and then used to transfect E.coli XLl-Blue. 
Single stranded DNA was prepared from bacteriophage 
particles of a number of clones and subjected to DNA 
sequence analysis. One clone having the correct DNA 
sequence was designated mH0B16 (Figure 4). 

A molecule in which the mature HSA coding region was fused 
to the HSA secretion signal was created by insertion of 
Linker 4 into Bam HI and Xho l digested M13mpl9.7 to form 
pDBD2 (Figure 4). 

Linker 4 





M 


X 


W 


V 


S 


F 


5' GATCC 


ATG 


AAG 


TGG 


GTA 


AGC 


TTT 


G 


TAC 


TTC 


ACC 


CAT 


TCG 


AAA 



I 


S 


L 


L 


F 


L 


F 


S 


ATT 


TCC 


CTT 


CTT 


TTT 


CTC 


TTT 


AGC 


TAA 


AGG 


GAA 


GAA 


AAA 


GAG 


AAA 


TCG 
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s 


A 


Y 


S 


R 


G 


V 


F 


TCG 


GCT 


TAT 


TCC 


AGG 


GGT 


GTG 


TTT 


AGC 


CGA 


ATA 


AGG 


TCC 


CCA 


CAC 


AAA 



R R 
CG 3' 
GCAGCT 5 ' 

In this linker the codon for the fourth amino acid after 
the initial methionine, ACC for threonine in the HSA pre- 
pro leader sequence (Lawn et al , 1981), has been changed 
to AGC for serine to create a Hin di I I site. 

A sequence of synthetic DNA representing a part of the 
known HSA coding sequence (Lawn et al > , 1981) (amino acids 
382 to 387, Fig. 2), fused to part of the known 
fibronectin coding sequence (Kornblihtt et al . , 1985) 
(amino acids 585 to 640, Fig. 2), was prepared by 
synthesising six oligonucleotides (Linker 5, Fig. 6). The 
oligonucleotides 2, 3, 4, 6, 7 and 8 were phosphorylated 
using T4 polynucleotide kinase and then the 
oligonucleotides were annealed under standard conditions 
in pairs, i.e. 1+8, 2+7, 3+6 and 4+5. The. annealed 
oligonucleotides were then mixed together and ligated with 
mHOB12 which had previously been digested with the 
restriction enzymes Hin di and EcoRI . The ligation 
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mixture was then used to transfect E.coli XLl-Blue 
(Stratagene Cloning Systems, San Diego, CA) . Single 
stranded template DNA was then prepared from mature 
bacteriophage particles derived from several independent 
plaques and then was analysed by DNA sequencing. A clone 
in which a linker of the expected sequence had been 
correctly inserted into the vector was designated pDBDFl 
(Fig- 7). This plasmid was then digested with Pst I and 
EcoRI and the approx. 0-24kb fragment was purified and 
then ligated with the 1.29kb BamKI-Pstl fragment of pDBD2 
(Fig. 7) and BamHI + EcoRI digested pUC19 (Yani sen- Perron, 
et al ., 1985) to form pDBDF2 (Fig. 7). 

A plasmid containing a DNA sequence encoding full length 
human fibronectin, pFHDELl, was digested with EcoR I and 
Xhol and a 0.77kb EcoR I- Xho I fragment (Fig. 8) was 
isolated and then ligated with Eco RI and Sai l digested Ml 3 
mp!8 (Norrander et al ., 1983) to form pDBDF3 (Fig. 8). 

The following oligonucleotide linker (Linker 6) was 
synthesised, representing from the Pst I site at 4784-4791 
of the fibronectin sequence of EF-A-207 751 to the codon 
for tyrosine 1578 (Fig. 5) which is followed by a stop 
codon (TAA) , a Hind i 1 1 site and then a BamH I cohesive end: 
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Linker 6 

G p.DQTEMTIEGL 
GGT CCA GAT CAA ACA GAA ATG ACT ATT GAA GGC TTG 
A CGT CCA GGT CTA GTT TGT CTT TAC TGA TAA CTT CCG AAC 

Q p T V E Y Stop 
CAG CCC ACA .GTG GAG TAT TAA GCTTG 

GTC GGG TGT CAC CTC ATA ATT CGAACCTAG 

This linker was then ligated with Pst I and Hindlll 
digested pDBDF3 to form pDBDF4 (Fig. 8).. The following 
DNA fragments were then ligated together with Bql ll 
digested pKVSO (EP-A-258 067) as shown in Fig. 8: 0.68kb 
Eco RI- Bam HI fragment of pDBDF4, 1 . 5kb BamH I- Stu I fragment 
of pDBDF2 and the 2 . 2kb Stu I- EcoR I fragment of pFHDELl . 
The resultant plasmid pDBDFS (Fig. 8) includes the 
promoter of EP-A-258 067 to direct the expression of the 
HSA secretion signal fused to DNA encoding amino acids 1- 
387 of mature HSA, in turn fused directly and in frame 
with DNA encoding amino acids 585-1578 of human 
fibronectin, after which translation would terminate at 
the stop codon TAA. This .is then followed by the 
S. cerevisiae PGK gene transcription terminator. The 
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plasmid also contains sequences which permit selection and 
maintenance in Escherichia coli and S.cerevisiae (EP-A-258 
067). 

This plasmid was introduced into S.cerevisiae S150-2B 
( leu2-3 leu2-112 ura3-52 trol-289 his3- I ) by standard 
procedures (Beggs, 1978). Transf ormants were subsequently 
analysed and found to produce the HSA-f ibronectin fusion 
protein . 

EXAMPLE 2 ; HSA 1-195 FUSED TO Fn 585-1578 

In this second example the first domain of human serum 
albumin (amino acids 1-195) is fused to amino acids 585- 
1578 of human f ibronectin. 

The plasmid pDBD2 was digested with BamH I and Bgl ll and 
the 0.79kb fragment was purified and then ligated with 
BamH I -digested M13mpl9 to form pDBDF6 (Fig. 6). The 
following oligonucleotide: 

5'-C CAAAG CT CGAG GAA CT T C G-3 ' 

was used as a mutagenic primer to create a Xho l site in 
pDBDF6 by in vitro mutagenesis using a kit supplied by 
Amersham International PLC. This site was created by 
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changing base number 696 of HSA from a T to a G (Fig. 2). 
The plasmid thus formed was designated pDBDF7 (Fig. 9). 
The following linker was then synthesised to represent 
from this newly created Xhol site to the codon for lysine 
195 of HSA (AAA) and then from the codon for isoleucine 
585 of fibronectin to the ends of oligonucleotides 1 and 8 
shown in Fig. 6. 

Linker 7 

DELRDEGKAS.SAK 
TC GAT GAA CTT CGG GAT GAA GGG AAG GCT TCG TCT GCC AAA 
A CTT GAA GCC CTA CTT CCC TTC CGA AGC AGA CGG TTT 

ITETPSQPNSH 
ATC ACT GAG ACT CCG AGT CAG C 

TAG TGA CTC TGA GGC TCA GTC GGG TTG AGG GTG -G 

This linker was ligated with the annealed oligonucleotides 
shown in Fig. 3, i.e. 2+7, 3+6 and 4+5 together with Xho l 
and EcoRI digested pDBDF7 to form pDBDF8 (Fig. 9). Note 
that in order to recreate the original HSA DNA sequence, 
and hence amino acid sequence, insertion of linker 7 and 
the other oligonucleotides into pDBDF7 does not recreate 
the Xho l site. 
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The CK83kb BamHI-StuI fragment of pDBDF8 was purified and 
then was ligated with the 0 . 58kb EcoRI -BamHI fragment of 
pDBDF2 and the 2.22kb StuI -EcoRI fragment of pFHDELl into 
Bql ll-diqested pKV50 to form pDBDF9 (Fig. 9). This 
plasmid is similar to pDBDFS except that it specifies only 
residues 1-195 of HSA rather than 1-387 as in pDBDFS. 

When introduced into S. cerevisiae S150-2B as above, the 
plasmid directed the expression and secretion of a hybrid 
molecule composed of residues 1-195 of HSA fused to 
residues 585-1578 of fibronectin. 

EXAMPLE 3 : HSA 1-387 FUSED TO Fn 585-1578 , AS CLEAVABLB 
MOLECULE 

In order to facilitate production of large amounts of 
residues 585-1578 of fibronectin , a construct was made in 
which DNA encoding residues 1-387 of HSA was separated 
from DNA encoding residues 585-1578 of fibronectin by the 
sequence 

I E G R 
ATT GAA GGT AGA 
TAA CTT CCA TCT 
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which specifies the cleavage recognition site for the 
blood clotting Factor X. Consequently the purified 
secreted product can be treated with Factor X and then the 
fibronectin part of the molecule can be separated from the 
HSA part. 

To do this two oligonucleotides were synthesised and then 
annealed to form Linker 8 . 

Linker 8 



E 


E 


p 


Q 


N 


L • 


I 


E 


G 


GAA 


GAG 


CCT 


CAG 


AAT 


TTA 


ATT 


GAA 


GGT 


CTT 


CTC 


GGA 


GTC 


TTA 


AAT 


TAA 


CTT 


CCA 


R 


I 


T 


E 


T 


P 


S 


Q 


P 


AGA 


ATC 


ACT 


GAG 


ACT 


CCG 


AGT 


CAG 


C 


TCT 


TAG 


TGA 


CTC 


TGA 


GGC 


TCA 


GTC 


GGG 


N 


S 


H 














TTG 


AGG 


GTG 


G 













This linker was then ligated with the annealed 
oligonucleotides shown in Fig. 6, i.e. 2+7, 3+6 and 4+5 
into Hindi andEcoRI digested mHOB12, to form pDBDFlO 
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(Fig. 7). The plasmid was then digested with Pst I and 
EcoRI and the roughly 0.24kb fragment was purified and 
then ligated with the 1.29kb BamH I- Pst I fragment of pDBD2 
and BamHI and EcoRI digested pUC19 to form pDBDFll (Fig. 
10) . 

The 1.5kb BamH I -Stu I fragment of pDBDFll was then ligated 
with the 0.68kb EcoR I - BamH I fragment of pDBDF4 and the 
2.22kb StuI -EcoRI fragment of pFHDELl into Boll I -digested 
pKV50 to form pDBDF12 (Fig. 10). This plasmid was then 
introduced into S.cerevisiae S150-2B. The purified 
secreted fusion protein was treated with Factor X to 
liberate the fibronectin fragment representing residues 
585-1578 of the native molecule. 
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CLAIMS 



1. A fusion polypeptide comprising, as at least part of 
the N- terminal portion thereof , an N- terminal portion 
of HSA or a variant thereof and, as at least part of 
the C-terminal portion thereof, another polypeptide 
except that, when the said N-terminal portion of HSA 
is the 1-n portion where n is 369 to 419 or a 
variant thereof then the said polypeptide is (a) the 
585 to 1578 portion of human fibronectin or a variant 
thereof, (b) the 1 to 368 portion of CD 4 or a variant 
thereof, (c) platelet derived growth factor or a 
variant thereof, (d) transforming growth factor £ or 
a variant thereof, (e) the 1-261 portion of mature 
human plasma fibronectin or a variant thereof, (f) 
the 278-578 portion of mature human plasma 
fibronectin or a variant thereof, (g) the . 1-272 
portion of mature human von Willebrand ' s Factor or a 
variant thereof, or (h) alpha- 1 -antitrypsin or a 
variant thereof. 
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2. A fusion polypeptide according to Claim 1 
additionally comprising at least one N-terminal amino 
acid extending beyond the portion corresponding to 
the N- terminal portion of HSA. 

3. A fusion polypeptide according to Claim 1 or 2 
wherein there is a cleavable region at the junction 
of the said N-terminal or C- terminal portions. 

4. A fusion polypeptide according to any one of the 
preceding claims wherein the said C-terminal portion 
is the 585 to 1578 portion of human plasma 
f ibronectin or a variant thereof . 

5. A transformed or transfected host having a nucleotide 
sequence so arranged as to express a fusion 
polypeptide according to any one of the preceding 
claims. 

6. A process for preparing a fusion polypeptide by 
cultivation of a host according to Claim 5, followed 
by separation of the fusion polypeptide in a useful 

. form. 

7 . A fusion polypeptide according to any one of Claims 1 
to 4 for use in therapy. 



WO 90/13653 

FIGURE 1 



1/18 



PCT/GB90/006SO 



10 20 
Aso Ala His lys Ser Glu vai Ale His Arg ?he Lys As? Leu Gly C-lu Glu Asr. ?r.e Lys 

30 « 
Ala Leu val leu lie Ala ?r.e Ala Gin Tyr Leu Gin Gin Cys Pro Phe Glu Asp H.s Val 

50 50 
Lys Leu Vai Asn Glu Vai Thr Glu Phe Ala Lys Thr Cys Vai Ala Asp Glu Ser Ala Glu 

70 90 
Asn Cys As? Lys Ser Leu His Thr Leu Phe Gly As? Lys Leu Cys Thr vai Ala Thr Leu 

90 ■ CC 

Arc Glu Thr Tyr Gly Glu Met Ala As? Cys Cys Ala Lys Gin Glu Pro Glu Arc .Asn Glu 

no 12: 

Cvs ?ne Leu Gin His Lys As? As? Asn Pro Asn Leu Pro Arc Leu Vai Arc Pro Glu Val 

130 i40 
Asp Vai Met Cys Thr Ala Phe His As? Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr 

150 ISO 
Glu lis Ala Arg Arc His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arc 

170 '80 
Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gin Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro 

190 200 
Lys leu Asp Giu Leu Arc Asp Glu Gly Lys Ala Ser Ser Ala Lys Gin Arc Leu Lys Cys 

210 220 
Ala Ser Leu Gin Lys Phe Gly Giu Arg. Ala Phe Lys Ala Trp Ala Vai Ala Arg Leu Ser 

230 240 
Gin Arg Phe Pro Lys Ala Glu Phe Ala Glu Vai Ser Lys Leu Vai Thr Asp Leu Thr Lys 

250 250 
Vai His Thr Giu Cys Cys His Gly As? Leu Leu Glu Cys Ala Asp Asp Arg Ala As? Leu 

270 28C 
Ala Lys Tyr lie Cys Glu Asn Gin As? Ser lie Ser Ser Lys Leu Lys Giu Cys Cys Giu 

2S0 300 
Lys Pro Leu Leu Giu Lys Ser His Cys lie Ala Glu Val Giu Asn As? Glu Met Pro Ala 

310 320 
As? Leu Pro Ser Leu Ala Ala As? Phe vai Giu Ser Lys As? Val cys Lys Asn Tyr Ala 

330 340 
Giu Ala Lys As? Vai Phe Leu Gly Men Phe Leu Tyr Giu Tyr Ala Arc Arc His Pro Asp 

250 250 
Tyr Ser Vai Vai Leu leu leu Arg Leu Ala lys Thr Tyr Glu Thr Tnr leu Glu Lys Cys 



Cys Ala Ala Ala Asp Pre His Gl: 



Cys Tyr Ala lys Val Phe As? Glu .Pre lys Pro Lei 
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"GuHZ 1 Csrvr. 



— TSu" ~ — -oo 

Val Glu Giu Pro Gin Asn Leu lie Lys Gin Asn Cys Glu Leu Phe Glu Glr. Leu Giv Glu 



420 



- " 4T6 

?yr Lys ?he 'Gin Asa Ala Leu Leu Vai Arg Tyr Thr Lys Lys Vai Pro Gin VaiSer 



430 «0 
Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His 

450 460 
Pro Glu Ala Lys Arg Met Pro Cys Ala Glu As? Tyr Leu Ser Val Val Leu Asn Gin Leu 

470 4SC 
Cys Vai Leu His Giu Lys Thr Pro Vai Ser As? Arc Vai ?hr Lys Cys Cys Thr Glu Ser 

490 500 
Leu Vai Asn Arc Arg Pro Cys • Phe Ser Ala Leu Giu Vai As? Glu T. w _r Tyr Val Pro Lys 

510 520 
Giu Phe Asn Aia Giu Thr Phe Thr Phe His Ala As? lie Cys Thr Leu Ser Giu Lys Giu 

530 540 
Arg Gin lie Lys Lys Gin Thr Ala Leu Vai Giu Leu Vai Lys Kis Lys Pro Lys Aia Thr 

* 550 560 
Lys Giu Gin Leu Lys Aia Vai Me- As? As? Phe Aia Aia Phe Val Giu Lys Cys Cys Lys 

570 580 
Ala As? As? Lys Giu Thr Cys Phe Ala Giu Giu Gly Lys Lys Leu Vai Aia Ala Ser Gin 

Aia Ala Leu Gly Leu 
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FIGURE 2 DNA sequence coding for mature H5A 



10 20 3C 40 50 60 7C 80 

GA7GCACACAAGAG7GAGG77GC7CA7CGC777AAAGA7T7GGGAGAAGAAAA777CAAAGC 

D A H X £ Z V A H K T X D L G 2 2 N T X A L V L Z A ? 

90 100 .10 120 130 140 150 160 

TGC7CAGTATCTTCAGCAG7CTCCATTTGAAGATCATGTAAAATTAGTC 

AQYLQQC??2DBVXLVN2V7Z~AXTC 

-.70 180 190 200 210 220 230 24C 

TTGC7GA7GAG7CAGCTGAAAA77G7GACAAA7CAC77CA7ACCC7777TGGAGACAAA77A7GCACAG77G^ 
VAD25AENCDX5LH7I.PGDXLCTVA7L 

250 260 270 280 290 300 310 32C 

CS7GAAACCTATGG7CAAATGGCTGAC7GCTGTGCAAAACAACAACCTGAGAGAA 
RiTYG2KADCCAXQ2P2RN2Cr:,QKX2 

230 340 350 360 370 380 390 4CG 

TGACAACCCAAACC7CCCCCGA77GG7GAGACCAGAGG77GA7G7GATG7GCAC7GC7777CA7GACAATGAAGAGAtt 
DN?Ni.PRI*VR?2VDVMC7A?KDNZ27 

410 420 430 440 450 

7777GAAAAAA7AC77ATA7GAAA77GCCAGAAGACA7CC77AC7777A7GC 
F 1 X X Y L Y2IARRHPYPYA 

490 500 510 520 530 540 550 560 

TATAAAGC7GC777TACAGAA7G77GCGAAGC7GC7GA7AAAGC7GCC7GCC7G77GCCAAAGC7CGA7G 

Y XAA?T2CCQAADXAAC LI#?XI.32iRD 

570 580 596 600 610 620 630 640 

TGAAGGGAAGGC77CG7C7GCCAAACAGAGAC7CAAA7G7GCCAG7C7CCAAAAA777 

Z G X A 5 SAXQRLXCASLQXP G 2 R A F X A 

£5C 660 670 630 690 70C 710 720 

GGGCASTGGC7CGCC7GAGCCAGAGA7TTCCCAAAGCT 

WAVARLSQXr?XA2?A2VSX-V?DlTK 

730 740 750 760 770 780 790 600 

GTCCAC^CGGAA7GCrCCCA7GGAGA7C7GCT7GAA7G7GC7GA7GACAGGGCGGACC77GCCAAGTATATCTC7GAAAA 

V £ 7 2 C C K GDI«1»2CADDRAD L A X Y 2 C ■ 2 N 

810 820 830 840 850 560 870 680 

TCAGGA77CGA7CTCCAG7AAACTGAAGGAA7GC7GTGAAAAACC7C7GTTGSAAAAA7CCCAC 

QDSIS S XifXrCCS.XPLIfSX S H C Z A E V 

890 900 910 520 .930 940 550 960 

/JvAA7GA7GACA7GCCTGCTGAC77GCC77CA7TAGC7GC7GA7TT7G77GAAAG7» 

ZND2MPADLPSLAADPVZSXDVCXNYA 

970 980 990 1000 1010 1C2C «C30 1040 

GAGGCAAAGGA7C7C77CC7GGGCA7S77777G7A7GAA7A7GCAAGAAGGCA7CC7GA77AC7C7G7CG7GC7GC7GCT 



460 470 480 

:CGGAAC7CC7777C777GC7AAAAGG 
PZ L I# F ? A X R 
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riGVgg 2 Cor.- . 

1050 1060 1C70 1080 109C 

JCCAAG ACA 7 A7G AAACCA C7C7 AG AG AAG7 GC7G7 



RZ.AX7Yr771ZXCCAAAD 




1130 1140 1150 1 160 117C 1 1 8C * "-9C 120G 

7CGA7GAA777AA^CCTC77G7 GGAACAGC:rCAGAA777AA7 C^ 

-dztxplv 'I i ? 5 n I i x q n c z z ? z ; z c- z 




1210 1220 1230 1240 1250 1260 !27C :2SC 

TACAAATTCCACAATGCGCTATTACTTCGTTACACCAAGAAAGTACC^ 
YKrQNALLVRY TXKV? QvSr?7lvrvS 

1290 1300 1310 1320 1330 1340 135C 1360 

AAGAAACC7AGGAAAAG7GGGCAGCAAA7G77G7AAACA7CC7GAAGCAAAAAGAA7GCCC7G7GCAG^ 

?. NLGXV65XCCXH?SAXRM?CAZST1 

1370 1380 1390 1400 1410 1420 1430 

CCG7GG7CC7GAACCAG77A7G7G7G77GCA7GAGAAAACGCCAG7AAG7GACAGAG7CACAAAAT 
5VVLNQLCVLHZX7?VSDRV7XC 

1450 1460 1470 1480 1490 150C 15'C 1520 

77GG7GAACAGGCGACCA7G C 77T7CAGC7C7GGAAG7CGA7GAAACA7ACGT7CCCAAAGAC777AA7GC7GAAACA7T 
LVNRR?C7SALZVDZTYVPKZ?NA Z 7 T 

1530 1540 1550 1560 1570 1580 159C 1600 

CAC C77 C CA7GCAG AX A 7 A7 G CACAC777C7 GAG AAGG AG AG ACAAA7 CAAG AAA C AAAC7GCAC77 C77 GAG C77G7G A 
77EADICTX.SZXZRQZXXQTAL7ZLV 

1610 1620 1630 1640 1650 1660 1670 1680 

AACACAAGCCCAAGGCAACAAAAGAGCAAC7GAAAGC7G77A7GGA7GA77TCGCAGCTT77G7AGAGAAG7GC7GCAAG 
XHXPKA7XZQLXAVKDDZAAFV2XCCX 

1690 1700 1710 1720 1730 1740 1750 1760 

GC7GACGA7AAGGAGACC7GC777GCCGAGGAGGG7AAAAAAC77G77GC7GCAAG7CAAGC7GCC77AGGC77A7AACA 
ADDXZ7CZAZZGXXLVAA5 QAA1GL 



1770 1780 
7C7ACA777AAAAGCA7C7CAG 



SUBSTITUTE SHEET 



WO 90/13653 



TIC'JTS j Constructive cf =K05 i 6 



5/18 



PCT/GB90/00650 




WO 90/13653 



6/18 



PCT/GB90/00650 




SUBSTITUTE SHEET 



WO 90/13653 



PCT/GB90/00650 



7/18 < 

01 



Ov On Ol 0>.0>,0>»0«S O— OKI OC Q 3 O* OQ. 0+> O*. Oa Oc Or Ol oc 
© © cl nP —a ~0 —ID — < -> «\JU w< wi3 cv< cu< «nx t^a to< (ni5 >»< 



Pro 


Leu 


«0 

_r 


4? < 


5 


Leu 


Ala 


Met 


Arg 


Asp 


10 


Arg 


Thr 


Gin 


Thr 


Thr 


L. 
0) 

to 


Thr 


Asn 


Gin 


cn 


L 

P 


L. 

0) 
CO 


>> >> 
CD CD 


CO 


t? 


m 


-p 

G) 

2: 


c 
< 


«o 

2P 


to 


to 


*> 


c/l 
>k 


c 


< 


L> 


c 

0 


cfl 
X 


u 

r- 


1. 
CO 


L. 

h 


0 


7S « 
5 < 


L. 


I 


8- 

<r 


a 

L. 

r- 


cn 
< 


to 


CL 

r^ 


CL 
CO 
< 


*0 

>> 
O 


C 

to 
C 


i_ 

H 


0 


i_ 

r- 


1 


c 
tn 


L. 


c 




<o 

tl? 


< CD 


L. 

a) 
CO 


tO 

<3> 


0) 

a 


>i 
0 


L 

0) 
CO 


L, 
5 

to 


3 
CD 


i 




>i 
CD 




c 

< 


L 

(V 
LO 


3 

0 


c 
to 

< 


>^ 

0 


& 

CO 


CD 


< 




c 

CD 


3 
0 


5, 


c 

CD 


£ 


a 


» 
CD 




>> 

CD 


c 
tD 


< 


L. 


10 


I 


L. 
T" 


tO 
O 


I 


a 
£ 


Q. 




>> 
CD 


3 


w 








cn 

< 


8 

cx 






L_ 




a 


L. 


3 

s 


a 


c 

< 


c 

5 


>1 

CD 


f * 


>> 
CD 


i 


3 
O 


0 
a. 


1 


t 


CD 


CD 






. 3 
0 


x: 
a 


•3 


X 


i 


to 

2 


1 


c 

0 


CO 

< 


£ 5, 

« o 


3 

CD 


P 


< 




0) 

M 


>> 
CD 


1 


& 

to 






C 

0 


O 


to 

X 


1 


I 


i 


Pro 


n 


u 
0) 
CO 


l. a 
£ in 

r- < 


in 

X 


>> 

CD 




3 

CD 


F 
< 


OJ 
1— » 


CD 


CD 


0 

A. 

a 


a 

L 


10 

<? 


3 

0) 


7» 

0 


I. 

r 
H 


a 


C 

to 

< 


to 


<o— 

»— t 




01- oa o<n o>,po oap>,opiQt 
=5»2f Kfc 25 si S!£ 


Ol 
too 

CM CO 


^£ 

COCD 


KitD 


cryo 


0_ Oa o»o 

{R5 82 5^ 


COX 




Gin 


c 
0 


0 


m a> 
2P = 


F 

< 


h 


cn 

2P 


L. 

P. 


CO 


u 


v> 


im 

to 


0 
i. 

CL 


1S 

X 




tO 
t? 


c 
CD 


£ 


•0 
U 


f 


Pro 


L 


L, 


s- « 

<r x 


c 


3 
CD 


& 


3 
CD 


CD 


0) 

to 


cv 

1-4 




to 

X 


>1 

0 


0 


0 

L 

a 


E 1 
< 




3 


Ol 

L 
< 


c 
0 




to 


a) >- 

X 0) 

a 10 


CO 
< 


i0 
X 


r- 


>> 
tD 


3 • 

5 


L 

F 


CP 


u 
r- 


0 

L. 

a 


1 


55 
< 


3 
0 


>^ 
0 




< 


0 






H 






0 

L 

a 


a 


1 


>i 

CD 


a> 

u 

< 


c 

0 


c 
0 


c 
CD 


L 
0) 

CO 


0 


>> 
0 


3 
0 


to 


Ul 


3 
0 


Met 


>> 
0 


(A 








3 

CD 


5 


3 
—J 




3 

3 


5 


e 

a 


u 


3 
3 


C 

to 


L. 

r 


C 

0 


1 


a) 
to 



5l^5acT<or^d , <*<?!o5o , «nH<(nr- 

O4!_)a><<nr-_l{n>-0<r-r->r-<OO<CJ 
OLi<ar ? <<<«J<<<< < «0-J Or- r-O r- 



SUBSTiTU"! SHEET 



WO 90/13653 PCT/GB90/00650 



8/18 



CD 
in 



cn 



9> 




Ou O© 

cd>> or 


100 100 


mo 


So 

COL. 


o«n 


0- 
i9> 


v9_i (90 


O- 


Oqj 

ft= 


OCO 
N< 


2f= 

«"X 
Si- 


Ou Oo 

is>) cot: 


oa o c 

O ft- CM— VT— 

<DH <D0 CD0 


e? 


+1 

to 


< 


ft. 

|£ 


P 
< 








a 


L. 
O 

in 


0 


L- 

0 


L. 

(5 


3 
0 


0) 
X 

a 


ft. 
Q) 

cn 




< 


C71 
ft. 
< 


ml 

0 


3 

Q) 
1 


1— t 


a 




a 


0 


QJ 
X 
CL 




ft. 

0) 
10 




c 

1 


if* 


tfi 
X 


C 
< 




>^ 

5 


8 

CL 


c 
< 


X 


I 


1 


Cl 
CO 


15 


*2 

X 


«o 


C 


c 

0 




ft. 


k. 

a) 

co 


c 

•— « 




QJ 


>j 
0 


0) 

cn 


1 


k. 
cn 




I 


X 
H 


1 


cv 

tA 


a) 

CO 


J 
CD 


c 

5 


A. 

X 

h- 


I 


>> 

o 




c? 


1. 

0) 

CO 


ft. 

a 


O 
ft. 

a 


ft. 


k. 

P 


X 


a3 

CO 


I 


a 

in 

<: 


0) 


C 
0 


a> 


o 

ft. 

CL 


S 
—j 


co 
n: 




a 


c 


a> 

X 

a 




c 


0 
u 

LL 


X 




ft. 


C 
0 


I 


•j 
0 


X 


3 


L. 


QJ 

cn 


a) 
CO 


cy 

CO 


X 

H 


< 


a 
< 


5 






5 


L 

P 


L. 

? 


l. 
m 


it 




C 

0 


0 
ft. 
CL 


Q» 
CO 


Q. 






X 


L. 

X 




$ 


<: 


a 
£ 


>» 
5 




& 
i/) 


cn 

ft- 
< 


ft- 


1 


m 
l. 
< 




UJ 


ft_ 


ft. 
x: 
\- 


ft. 

<n 


.c 

0 




ft. 
cy 
to 


CL 

.< 




ft. 
o 

CO 


jj 
o> 
X 


c 

o 


k 


0) 
»-» 


> 


8* 


I 


C 

CD 


o 

i. 




G> 
, ,t 


0) 

in 


ft. 

a) 
tn 


< 


(0 

< 


0 
ft. 

a 


>» 

UJ 


0) 

_i 


a 

in 
< 


sV 
♦— « 


c 


U 


a 


>» 
5 


a 
< 


n 
< 


c 
5 


0 


D 
0) 
-i 


0 


QJ 
»— • 


tO 
X 


CO 


w. 

X 
H 


1 


0) 
CO 


3 

5 


0 
ft. 
£L 


Q) 
•— i 




cn 
k. 
< 


cO 
< 


OlO 


o>* or oa 

§5 52 52 


S3 


OtA 

B5* 


&5 Re 

inx ma. 


Ol 
tf>in 


O L. 


ft— 

td0 


03 


Oj- 

c9in 


03 
v9-l 


O-: 


oa Od 

rniQ lOQ) 
S< |^-J 


03 

^_I 


r^0 


Oft- 




0) 
X 

a 


»-« 






13 
x 


C 
0 


1 


c 

0 


0 

i>. 

CL 




1 


i. 

0 


P 


0 

a 


a 


>i 
0 


3 

QJ 
_J 


u 
cy 

CO 


CL 
< 


0 


3 

0 


>\ 
0 


cn 

ft- 
< 






X 


CL 
wo 
< 


ft. 


co 
5* 


ft. 


ft. 
ft) 

LO 




0 


X 

H 


0) 

cn 


a 

cn 


3 
0 


a 


c 
0 


I 


ft. 

|5 


0 
L. 
Cl 


| 


ft. 


u 


m 
>» 
o 


>> 

0 




cn 

IT 


X 


3 

0 


0) 
• 


L 
X 

• H 


3 
0 


ft. 

X 


ji 

CL 


1 


3 
0 


0 

i. 

CL 


3 

0 


k. 


01 


3 

OJ 

-J 




■s 

r 




c 

o 


3 


£ 

Cl 


3 

CD 




i- 


<rt 
X 


0) 

< 


L. 


Q) 
X 
CL 


0 
L. 
Cl 




i_ 

a 
if) 




>> 
0 


0 
L. 
a 


0 

ct 


c 

4? 


c 

0 


1 


in 


a 


3 
0 


a 


CL 


0 


Q> 
»— • 


. ft. 

A) 

tn 


'3 
0 




a 

CA 

< 


l. 


Q) 


3 
ii 


c 


Cl 


a 
* 


(0 

< 




a 

ifl 
< 


0 


p 


< 


X 


in 


&. 

0) 

CO 


>l 
0 


CV 


0 
k. 
a 








L. 


cn 


3 
0 


1 


3 

0 


0 

ft. 
a 


ft. 

0 


3 
0 


< 


3 

5 


* 




p 

< 


ir 


Q. 




1 


c 

0 


Cl 
l. 
h- 


0 


< 


3 
0 


<n 


L 


ft. 

0) 

cn 


ft. 

a> 
CO 


0 
ft. 
Cl 


0 

ct 






S 
< 


-P 


n 








>> 


3 


0 


m 

u 


0 
ft. 








3 


A- 

X 


QJ 




p 


i. 


• < 


<V 


0 






0 


0 


LU 


CL 


< 


a 




0 


< 


O 




i— i 


< 




co 


ft. 




X 


& 
tn 


X 






f 


1 


o 
< 


0 


en 

2T 


1 






1 


< 


c 

0 


a 
tn 
< 


ft. 

a) 

CO 


u 
OJ 
CO 



z 

CL 



WO 90/13653 PCT/GB90/00650 



9/18 



2° 

CDO. 



ocn 03 



o o- om 
(M3 <h> <h< 



OL 

CD a 



8- O- 
8§ 8^ 



i i 



<n 

o 
c 
0 

i 

3 
0 

2 
0 

< 



x 



s 

CL 



5 

a 
< 



0 
u 

2 

CL 



a 
Z 

L. 
I— 

I 

m 



•n 
I 

3 

0 

2 

CL 

3 
QJ 



e 

a 



3 
0? 



X 
3 



8 

ea- 
rn 



L. 
X 

K 

3 

c 

I 



O0 0«~« 00- 
p* 



5 5 - 

j Ji H 

P To -3 

< 5 < 



0) 

cn 
0 



c 

tn 



0) 

in 

P 
< 

o 

L. 

a 

nu o— 
CDI — ff» 

2> a- 



0 

L. 

X 

2 >> 
cd a 

(A 

5 



3 
O 

P 
< 



5 



s » 8* 

o\> <na omo 

t 



2 £ 



L. 

c 
5 

I 

>\ 

o 

i- 
a 



3 3 
0 5 



5 
1 



x 

CL 

c 
o 

3 

o 
in 
< 



> 

p 

< 
5 

X 

1 

< 



1 1 



p 

< 



0) 



0) 



o 

%m 



3 

0> 



ip 

0) 
X 

CL 



*■ 

0) 
X 

a 
c 

5 

3 

3 

C 

in 

O >- 

2 

a 

3 
-J 

L- 

X 



>1 
0 

X 



O 
u 
Cl 

>\ 
CD 



5 

> a 
J 0 



CD 

L. 

X 

H 
a 

(0 

< 



tn 



3 

a) 



- 1 
•-« > 



cr 
in 

u 



0 

< 

Oa> o c 

O ~gl 

a < 
o 

L. 

a 



to x 
> h- 



u r ^ 
8*5 KX 



OlD 0> Oh 
^ 3 
ID 



L> 



O— Ou O- 03 
3 *" 

5 
on 
< 

o 

<r 

3 

>^ 
O 
>> 

o 

c 
ID 

2° Ol 

8t ^ 

^P^"5 
< > 



5 



3 
CD 

I 

CD 
o 
a 

i_ 
x 
H 

3 
0) 



5 



5 



Q) o 

^ a 

° ^ 

a < 



X 



0) 



Ocn Ol QO 

(nj_j cuh oja 
~o 



s 1 



5 



< 



< 



CD 
x 



»— 

a: 



a 3 

in — 
< 0 



0) 2 



L. 

X 

H 

L, 



0) 

in 

5 



o 

L> 

a 
o 



3 

< 
_cv 

QJ 



a 
o 

L. 

a 

< 
a 

x 

I 
P 



8 



2 Pen 
P J3 Ui 



a 
in 
< 

Eg 23 



x 



9 

a 



x 
I- 



c 

CD 

F G 

P P 
< < 



P 



« 5 



L. 

X 



C 

ID 
c 
5 



(0 

> 



3 

CV 



L 

r 



P 
< 

L. 

X 
3 



< 

o 

u 

a 
c 
cD 

3 
0) 
-J 

c 
m 
< 

cn 

< 

3 

3 
o 
t 



x 

2 
a 

L. 
CV 

in 

3 

CD. 
c 

CD 

c 

2. 



5 
2 

cD 



c 

j 

L. 

2 

a 

e 

a 



3 L 

m Q) 

— 1 «n 

0) u 

x o> 

a <n 

^ a- 

o < 

£ & 

»— < m 

P ± 



c 

CD 

>^ 
0 

a 
m 
< 

cn 

L 
<I 

3 
-J 

I 

C 
0 



3 
0> 



5' 



X 

2 

a 
o 

u 

a 



3 

0) 



O 

L. 

a 



L_ 

X 



X 



QJ 

«n 

c 

3 



3 
0 

cuj 2!< 

a 



0 

2 
a 



QJ 

m 



p 

2 

0 

a 



c 

0 
C 
0 

0 



m 

1 

^ o 



x 



cv 

cn 

3 

C 
< 

a 
tn 
< 

0) 
X 

a 



x 
u 



S 
a 

1 

0) 

cn 

3 
0 



t 

CL 



0 



c 



0) 
X 

a 

cn 
k. 

< 

3 

0) 



a 
< 



WO 90/13653 



PCT/GB90/00650 



10/18 

o 

LTl 



I 


8* a? s* 

cn<5 co< io< 


(9 l_ 
n< 


oc 03 00 9P 
a>5 Oft> nl 2« 
«o< *a « a 


h 


©^Oftl WX «-?0-?<Dft) 

<*cd join Si- £0 jncD jom 


Ol 
o.c 


Or 0 cfl O >, 
\SO <9tD 


Ol 

Oft) 
tfitO 


CL 

a 


i 




k. 

X 

r- 


ft) 


cn <0 
< < 




< 


2 

CL 


0 

L. 

CL 


CO 


1 


s 


< 




L. 

X 

)- 


< 


3 
O 


to 




5 


? 


2 

CL 


ft> 


P 
< 


to > 


a 

$ 


a 
< 


a» 
to 


m 


< 


■ c 


s 




I 


3 


to 






0) 
-J 


O 


3 

<D 

. -J 


t 

h 


a 


l 


x ^ 


ft> 


a 


c 
Ifl 

< 


3 
ft) 
-1 


2 
a 


* 

X 


0 
u 

a 


tfl 


r* 




3 
ft) 
-J 


0 

L. 
CL 


I 




1 


c 

tfl 


i_ 

0? 

tn 


L. 

cv 
a) 


>^ 

0 


P »- 

£ 5! 


1 


L. 
01 
tO 


CD 


>i 
0 


k. 

V 

to 


c 
(D 


to 


1- 


3 
<D 


L» 

X 

r- 


i_ 
0) 

to 


i. 
X 




a 


in 


i 
t- 


3 


0) 

5 


u 
X 
r* 


1 I 


a. 
X 

r- 


ft) 
M 


>> 
tD 


& 

cn 


Ol 
tfl 
< 


L 
ft) 

to 


k 
to 


tfl 

2? 


1 


C 
CD 


L- 

r- 


<0 
> 


w 

0) 

to 


i? 


1 


■3 


X 


a 

J2 


ft) 


< > 


ft> 

tO 


3 


ir 
r- 


a) 
♦— » 


CD 


£ 

a. 


to 


*- 
X 

r- 




1 


O 
u 

a 


< 


k. 
0) 

to 


! 



i 5 5 = f 4 f 5 S o h < a o- 1 3 1- > m «r 

fc ■= 3 » <1 3 3 C fc^«>>&S-^ - 2 O EP " >, 

n 5 5 < 5 6 0 ^ o < o < fl 1 >< 4 1- 

LDLLoigii^ i^itf u2S-H = >,2fl 

J3< S§ Si <o£ io£ $5 5= 5a |# ££ Sis £^ mo £0 2«o <sh 2 o 2 < 



5 


Asp 

« 


to 


Ser 


Ala' 


Arg 


Pro 


Leu 


Thr 


He 


* 


Ala 


Thr 


5 


Asn 


Glu 


Glu 


2 
a 


f 




5 


Leu 




Ser 


Asp 


o> 


>i 
CD 


Thr 


Leu 


< 


Arg 


10 


&. 


Arg 


Ser 


to 

iP 


0) 


G 




Leu 


Asn 


3 


1 


Pro 


Val 


Leu 


Trp 


u 
ft) 

to 


Lau 


Pro 


Ala 


Tyr 


CD 


Val 


Tyr 


0) 

*-* 


Pro 


Thr 


Ser 


3 

0) 
-J 


Gin 


Ite 


Tyr 


1 


L. 
ftl 

tfl 


L 

cv 

to 


* 


His 


0) 
X 

a 


Asn 


Ser 


> 


Lp 

P 


Pro 


Thr 


Asn 


Ser 


Thr 


Met 


Pro 


Asp 


1 


3 
CD 


2P 


Thr 


lie 


Val 


Thr 


1 


tn 

X 


Thr 


3 

CD 


1 


Arg 






ft) 


Asn 


Thr 


3 
0 


Asn 


Thr 


5 
< 




Thr 


3 

3 


L. 

tn 


Val 




Thr 


3 

CD 


3 
-J 


3 

5 


3 

0 


Val 


Thr 


Thr 


te 
to 


CL 
to 


Thr 


Thr 


Gin 


Pro 


Pro 


X 


Ala 


Asp 


Leu 


L. 


c 
CD 


Phe 


Pro 


Thr 


Arg 


Leu 


Thr 


£ 

a 


Tyr 


ft) 


C 

O 


Val 


Gin 


Ala 


Ala 


Pro 


Pro 




lie 


Glu 


3 

ti) 


Arg 


Ser 


His 


0) 


>i 
CD 


Asp 


^ 


Glu 


Asp 


Pro 


Val 


Arg 


Asp 


p 


Pro 


Thr 


O 


Met 


Ser 


* 

< 




tD 


Asn 


X 


L 
V 
10 


Asn 


Arg 


Ala 


c 

cD 


Val 




Asp 


u 


Pro 


1 


0) 


Trp 


Thr 


3 



SUBSTITUTE SHEET 



WO 90/13653 



PCT/GB90/OO650 



11/18 



Thr 


Thr 


2 


Thr 


Asn 


Lau 


<r 


Pro 




Lys 


3 
CD 


Asp 


c 
CD 


o> 


Trp 


3 
CD 


Thr 


Val 


Asp 


Arg 




1 








t 








3 




0 


L. 


c 


o 

k. 


i- 

0) 




3 


m 

2P 


X 


3 






h 


< 






< 


< 




ID 


< 


a 




CD 


a 


01 








h 


0 


X 


1 


3 
0 


Asp 


W 


Asn 


Ser 


Arg 


Val 


Thr 


>» 


>» 
CD 


CD 


>i 
0 


Thr 


He 


Thr 


Gly 


HIS 


Pro 


Trp 


>y 
0 


>\ 

0 


Thr 


Val 


Arg 




0 
w 

a 


Pro 


3 

5 


2? 
(D 


lie 


X 


Pro 


1 


Ala 


Thr 


>i 
CD 


Thr 


Arg 


c 

0 


3 
0 


i. 
0> 
I/) 


C 

a 


Ala 


Gin 




Thr 


Ala 


£ 


Arg 


Pro 


3 
0» 
— 1 


Leu 


His 


Ser 


Thr 


Thr 


Val 


Leu 


c 
CD 


Asn 


Asp 


>t 
0 


Ala 


Asp 


Phe 


Asp 


Tyr 


Asp 


C 
CD 


Pro 


Glu 


Pro 


Asn 


Thr 


Pro 


Thr 


Gin 


Pro 


Thr 


Gin 


3 
0) 
-1 


>i 

0 


Phe 


Pro 


Thr 


>\ 
O 


Pro 


3 
0) 
-J 


QJ 

l-H 


Trp 


Pro 


Leu 


3 

ID 


Pro 




Gin 


Pro 


Ser 


Hfs 


Ala 


Asp 


>> 
0 


I 




tn 
< 


1 


Thr 


Lys 


Tyr 


Ala 


*• 




>> 
3 


is 

in 


His 


Phe 


Gin 


Pro 


Leu 




Ser 


Lys 


Glu 


Ala 


Leu 



SUBSTITUTE SHEET 



WO 90/13653 



PCTYGB90/00650 



12/18 



8!r, 9^ Q(0 ocn 9c o D 



Met 


Thr 


>\ 

0 


Arg' 


to 


Pro' 


tf 


Ala 


Leu 


cj 


Tyr 


Met 


C 

0 


0 


Tyr 


Asn 


Gin 


Phe 


Gly 


HIS 


Glu 


Asp 


c 

(A 
< 


Cys 


Asn 


Pro 


2? 




Tyr 


Glu 


Glu 


< 


Gin 


Arg 


Ser 


Q) 


0 


Cys 


Trp 


Trp 


Gin 


Pro 


c 
0 




Gin 


>> 

0 


>> 
0 


in 

e? 


Arg 


J* 

a. 


3 
0 


Arg 


Thr 


< 0 




2 * 
O 0 



2^ 
0 



ZP 



5 
< 



> 

X 

L. 

l> 

L. 
JZ 

r- 



0 0 

^ i? 
0 0 



c 



5 



* 4 £ 



3 



>1 

0 

a 



B 

Ql 

2 

0 



CO 

Q_ 
tn 
< 

0 

F < 

< < 
c 

0 



< 

c 

0 



u 1 # 0 :c 
c ^ (X en i_ — 



> U K M 



B 

CL 



< 



a 



LTl 



WO 90/13653 



13/18 



PCT/GB90/00650 



1 2 

i ' , — . 

GAAGAGCCTCkGAATTTAATC^ 

CTTCTCGGAGTCTTAAATTAGTGACTCTGAGGCTCAjGTCGGGTTGAGGG 

I . ; ! 

eepg nl it e| t p s qpn s h pi q w 

8 



AATGCACCACAGCCAXCTCACATTTCCAAGTACATTCTCAGGTGGAGAC CTAAAAATTCTG TA 
TTACGTGGTGTCGGXAGAGTGTAAAGGTTCATGTAAGAGTCCACCTCTGGATTTI^ 

napqpjshiskyilrwrpknsv 

7 



GGCCGTTGGAAGGAAGCTACCAIAC CAGGCCACTTAAACTCCTACACCATCAAAGGC CTG 
CCGGCAACCTTCCTTCGATGGTATGGTCCGGTGAAf^^ 

1 ; I 

gjrwk e a t i p g h 1 n sj y t i kg .l 

S 5 



Figure 6 Linker 5 showing the eight constituent oligonucleotides 
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Fig. 7 Construction of pDBDF2 
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Fig. 8 Construction of pDBDF5 



SUBSTITUTE: SHEET 



WO 90/13653 



PCT/GB90/00650 




Fig. 9 Construction of pDBDF9 
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Fig. 10 Construction of pDBDFI2 
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